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Complex networks are mapped to a model of boxes and balls where the balls are distinguishable. 
It is shown that the scale-free size distribution of boxes maximizes the information associated with 
the boxes provided configurations including boxes containing a finite fraction of the total amount of 
balls are excluded. It is conjectured that for a connected network with only links between different 
nodes, the nodes with a finite fraction of links are effectively suppressed. It is hence suggested that 
for such networks the scale-free node-size distribution maximizes the information encoded on the 
nodes. The noise associated with the size distributions is also obtained from a maximum entropy 
principle. Finally explicit predictions from our least bias approach are found to be born out by 
metabolic networks. 
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Networks galore! Representations of real com- 
plex systems in terms of networks range over all 
science from social science, economics, internet, 
physics, chemistry and biology and much more. 
Basically a network is a representation of who or 
what is connected to or influences whom or what. 
It takes the form of some irregular cobweb where 
the parts (the who's or what's) are connected by 
links. The parts are called nodes so the represen- 
tation is in terms of nodes and links (see Fig.l). 
One feature of a network is the number of links 
which are attached to a node: a network can be 
characterized by iV(fc), the number nodes with k 
links attached to them. In many real networks 
one finds that this distribution over sizes k is very 
broad and power law like i.e. N(k) ~ /c -7 . Why 
is this and what does it imply? This is still to 
large extent an open question. Here we adress 
this question using the maximum entropy prin- 
ciple. The predictive value of this principle is 
greatest when it fails! For example, when it was 
found that the measured blackbody radiation did 
have a smaller entropy than the one predicted 
from Maxwells equations and classical statistical 
mechanics, the maximum entropy principle (had 
it been known at the time) would immediately 
tell you that a physical constraint was lacking in 
the theory. This alas turned out to be the Planck 
constant. We use the same reasoning here: we 
first find that the maximum entropy for an un- 
constrained network does have a larger entropy 
than the broad distribution found in real systems. 
So there is a missing contraint! We argue that 
this missing contraint is the advantage (in many 
cases) of maximizing the information encoded on 
the nodes. Thus we are suggesting that the real 
advantage is not to maximize the total number 
of possibilities but rather to maximize the infor- 
mation encoded on the nodes. Our "least bias" 
approach gives explicit predictions for real net- 
works which can be tested. We demonstrate that 



metabolic networks (a network representation of 
the metabolism in a cell) are likely to be a maxi- 
mum information network. 



Complex networks have undergone a rapid surge of 
interest and a number of review articles already exist 
[1][2][3][4]. A striking observation in this field is that 
many real networks have a broad scale-free like degree 
distribution. Why is this and what does it imply for 
the evolution mechanism of the networks? There exists 
many suggestions [lj] 3 [J] . Most suggestions concerns 
steadily growing networks. The most well-known pro- 
posal for an evolution mechanism which produces a scale- 
free network for this case is the preferential attachment 
scenario This mechanism rests on two explicit as- 
sumptions: 1) a link on a new node attaches randomly 
to a node belonging to the network with a probability di- 
rectly proportion to the number of links that are already 
attached to this latter node. 2) once a link is in place it 
stays in place. Although this model mechanism has be- 
come a successful prototype explanation in many cases 
[l| , its applicability is obviously limited by the very spe- 
cific model assumptions. Other suggestions are based on 
various optimization ideas [gL and, finally, some concern 
non-growing networks [7] [§][9j] . 



In the present paper we discuss non-growing networks. 
Such a network has a fixed number of nodes and links 
and the time-evolution is through rewiring of links. First 
we discuss the definition of a random network. Next we 
address the question "What is the least bias needed to 
impose on a random network in order to obtain a scale- 
free distribution?". The possible "least bias" solutions 
are presented together with the corresponding noise. Fi- 
nally characteristic features arising from the " least bias" 
random scale- free network are extracted and compared to 
real metabolic networks. Some speculations are added. 
Our approach is based on information theory and statis- 
tical mechanics [lOj. 
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FIG. 1: Figure showing how an undirected network can be 
described by numbered balls placed in boxes. The balls repre- 
sent link-ends and are distinguishable because each link start 
and end on a specific node. 



RANDOMNESS AND STATES 



In statistical mechanics random means that each pos- 
sible state of a system is equally probable. Consequently, 
one first has to identify what is meant by a state before 
the concept of randomness takes on a precise meaning. 
Here we will follow Ref. [TO] by making the connection 
between states and different distinguishable ways of dis- 
tributing objects. For example the lower right hand box 
in Fig.l contains four distinguishable (numbered) balls. 
There are 4! = 24 ways in which these four balls can 
be distributed into the box. All these ways are distin- 
guishable because the balls are distinguishable and thus 
correspond to 24 different states. This is just like the box 
contained four different compartments each of which can 
contain one ball. 

Figure 1 shows a non-directed network in terms of 
undirected links and nodes. Each link starts and ends 
with a link-end. Let us consider the link-ends on a node. 
In statistical mechanics one could think of the link-ends 
as particles. Since each link-end on a particular node 
is connected to another link-end on a different node, it 
means that the link-ends on a node are de facto distin- 
guishable. Thus the closest analogy with statistical me- 
chanics is that of boxes (nodes) containing distinguish- 
able balls (link-ends). A first description of the system is 
then in terms of balls in boxes [l2[ • In addition there 
are constraints imposed by the network topology. Typical 
constraints are no loops (links starting and ending on the 
same node), at most a single link between two nodes, and 
keeping the network connected. A necessary (but not suf- 
ficient) constraint corresponding to connected networks 
is that each box always contains at least one ball. The 
other two constraints are often unimportant and occurs 
with a vanishing probability in the limit of large random 
networks [2[. However, sometimes the no- loop constraint 
is crucial, as will be discussed further on. 



II. BALLS IN BOXES 

Let us start with randomly distributing M distinguish- 
able balls into N boxes. The total number of ways of 
picking the balls are Ml and the outcome is a distribu- 
tion into the N boxes where N(k) boxes has k balls. Since 
all the balls are distinguishable all boxes with k balls are 
distinguishable and consequently the number of distin- 
guishable ways of distributing the balls into the boxes, 
ft, are given by 



lnft = lnM!-^lnN(/c)! (1) 

k=0 

~ MlnM - M - ^2N(k)[lnN(k) - 1] (2) 

The degeneracy expressed by N(k)\ reflects the triv- 
ial fact that two boxes which contain the same number 
of balls have indistinguishable sizes. The connection to 
statistical physics is the one-to-one correspondence be- 
tween distinguishable ways o f distributing the balls and 
the states of the system [10]. The unbiased estimate 
of the total number of different states ft is given by 
the maximum of In £7 with respect to the distribution 
N(k), subject to the constraints J^fcLi N(k) = N and 

kN ( k ) = M - This 

gives by variational calculus 
— lniV(fc) + a — bk = with the solution 



N(k) = Aexp(-bk) 



(3) 



[a and b are the Lagrange multipliers corresponding to 
the constraints and A is a normalization constant corre- 
sponding to N(k) = N]. This is the exponential dis- 
tribution which within statistical physics in other vari- 
ables can be recognized as the well-known Boltzmann 
distribution. 

Suppose now that the balls instead are indistinguish- 
able. In this case the boxes can only be distinguished if 
the number of balls they contain are different. Compared 
to the case with distinguishable particles this means that 
all ways of distributing k balls into a box are indistin- 
guishable or degenerate. We define the degeneracy f(k) 
as the number of of ways of distributing balls into a box 
which are indistinguishable. This means that f(k) is the 
total number of of ways of distributing k different par- 
ticles into a box divided by the total number of distin- 
guishable states which the box can have. In the present 
example this is obviously f(k) = k\/l. Consequently to- 
tal number of distingushable state are now 



In ft = In M! - ^lniV(fc)! - ^ N(k) ln/(jfe)] 

k=0 k=0 

-MlnM - M -^2N(k)[lnN(k) -1 + lnfe!] (4) 



k=0 



and the unbiased estimate is again given by maximiz- 
ing ft with respect to the distribution N(k). The varia- 
tional solution in this case gives — In N(k)— In k\+a— bk = 
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0, and thus N (k) ~ ex.p(—bk) jk\ ~ const k /k\. This is the 
Poisson distribution which in the context of networks is 
associated with the random Erdos-Renyi (ER) network. 

As mentioned above a connected network corresponds 
to the case when all boxes contain at least one ball. How 
does this constraint change the distribution? It corre- 
sponds to first filling all the boxes with precisely one 
ball. The number of ways of doing this is M\/(M — N)\. 
The number of ways to fill in the remainder is given by 

ln(M-TV)! - J2 lnN ( k ) 1 - ^2 N ( k ) ln f(k). (5) 



k=i 



k=l 



Thus the crucial question is how the degeneracy f(k) 
changes. Let us start with ER-case and indistinguishable 
balls. The degeneracy f{k) is this time given by the num- 
ber of distinct ways to fill a box with distinguishable balls 
divided by the number of distinguishable ways to fill a 
box will the remaining k — 1 balls. For indistinguishable 
balls this is simple. There is only one distinguishable 
way to fill a box with k — 1 indistinguishable balls. Con- 
sequently, f(k) = y an d the distribution has again the 
Poisson-distribution form. 

The case of distinguishable balls is in this respect quite 
different: the number of distinguishable ways to fill a box 
with k — 1 distinguishable balls is obviously (k — 1)! and 



consequently the degeneracy is now f(k) 



k\ 
(k-iy. 



. Thus 



the total number of distinguishable ways of distributing 
distinguishable balls subject to the constraint that all 
boxes contains at least one ball is given by 

In n = In Ml - ln N ( k ) ! - N W ln k 

k k 

~M\nM -M -^N(k)[\nN{k)-l + \nk\. (6) 

k 

The unbiased estimate of the total number of differ- 
ent states Q is given by the maximum of \nQ with re- 
spect to the distribution N(k), subject to the constraints 

J2k=i N ( k ) = N and EfcLi kN ( k ) = M which b y varia- 
tional calculus gives — ln N(k) — ln/c + a — bk = with 
the solution 



N(k) 



Aex.p(—bk) 



(7) 



By this reasoning an unbiased random connected network 
has a degree-distribution of the form 



P(k) 



N(k) exp(-bk) 



N 



k 



(8) 



To sum up: We use the correspondence between the 
different ways to distribute balls and states in statistical 
mechanics. Taking this into account gives more possible 
states for distinguishable than for indistinguishable balls. 
The link-ends on a node correspond to distinguishable 
balls. An unbiased estimate assumes that all these states 



are equally probable. This leads to a degree-distribution 
P(k) ~ exp(— bk)/k which is distinctly different from the 
Poisson distribution. 

How close is the analogy between distinguishable-balls- 
in-boxes (DBB) model and a network? If we define a 
link by its link-ends such that an enumeration of them 
are given by (1, 2), (3, 4), (M - 1, M) then the map- 
ping is precise. However, not all possible distributions 
of links fulfill the requirement for a network. As pointed 
out above, the difference imposed by the network con- 
straints on the states of the DBB-model are often of mi- 
nor importance, except for the no- loop constraint which 
can be crucial. An example when the no-loop condition 
makes a crucial difference between the DBB-model and 
the corresponding network is the star-like networks (or 
more generally networks which contain nodes of order 
M): the number of possible states for such networks are 
significantly smaller in comparison with the number of 
the corresponding DBB-states. 

We note that a necessary but not sufficient (because 
the orderings of the objects in a box corresponds to dif- 
ferent states) condition for two states to be equal is that 
they contain precisely the same boxes provided a box is 
identified by the specific balls it contains; for two equal 
DBB-states all the boxes corresponding to the one state 
can be put on top of identical boxes corresponding to the 
other. Likewise for two equal states of a network: the 
two networks corresponding to the equal states can be 
put on top of each other so that all nodes and all links 
precisely match. 

How should one actually fill the boxes with balls in 
case of the constrained DBB-model? One starts with 
one ball in each box. Next one randomly chooses one of 
the remaining M — N balls and add another ball in an 
arbitrary box. This box is chosen with the probability 
p(k = 1)^1, p(k > 1) ~ k — 1 in order to ensure that all 
the possible states are given the same chance. It follows 
that 



P (i) = 

p(k > 1) = 



1 



N(l) + - l)JV(fc) 

k-1 

N(l) + Z(k-l)N(k) 



Throwing the balls into the boxes according to the 
above probability distribution gives the unbiased distri- 
bution P(k) ~ ex.p(—bk)/k (for k > 1). If the balls are 
already in the boxes and one wants to choose all possi- 
ble states with equal probability, then one have to use 
the procedure of choosing two balls randomly and then 
move the one to the same box as the other [13]. We 
also note in passing that for the unconstrained DBB- 
model, the probability reduces to p(k) ~ k which has the 
form of a preferential attachment to boxes. However, in 
our context it arises as a direct statistical consequence of 
distinguishable balls and in fact represents the unbiased 
situation with no preference what so ever! 
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III. BOX INFORMATION AND 
SCALE-FREENESS 

So far we have argued that the random distribution 
for a network is given by P(k) ~ exp(— 6fc)/fc. What 
type of bias in the sampling of different states are nec- 
essary for changing the degree-distribution to a power 
law P(k) ~ k 1 and what is the least bias necessary? To 
this end we consider the box information (the informa- 
tion contained within a box) of the DBB-model. The 
box (or "useful") information, s^ n , contained in a box 
with distinguishable particles is given by the logarithm 
of the number of possible different orderings of the balls 
within the boxes. For the unconstrained DBB model this 
is Si n = In A:!, while for the constrained DBB-model it is 
Si n — m (/c — 1)!- The total box information for the un- 
constrained DBB-model is hence Si n = ^2 k N(k) In k\. 
Note that this is the maximum box information you 
can store for a given distribution N(k). The global 
maximum is trivially the case when all balls are in the 
same box i.e. N(M) = 1 and N(k < M) = so that 
S^ ax = InM!. For the constrained DBB-model we in- 
stead have Si n = N(k) ln(fc — 1)! with the global 
maximum £™ ax = ln(M - N + 1)! for N(M - N + 1) = 1 
and N(l) = N — 1. The difference between the total 
box information for the unconstrained DBB-model, i.e. 
the maximum box information you can store for a given 
distribution N(k), and the total box information for the 
constrained DBB-model (divided by the number of boxes 
to get an intensive quantity) is 

AS in = ^J2 N ^ lnk ' W 

k=i 

Thus small ASi n means large box information. It 
seems plausible to us that a bias towards large box in- 
formation could be favored in various contexts and will 
hence consider how such a bias will affect the network 
degree-distribution. 

In order to find the distribution N(k) which corre- 
sponds to the smallest ASi n we do a variational calcu- 
lation: In addition to the basic constraints for a non- 
network, ££ =1 P ( k ) = 1 and Ek=i kN ( k ) = 
M/N we then also have the constraint ASi n [P(k)] = B. 
The fixed information value, B, introduces a bias on the 
purely random states. This biased Pe(fc) is obtained by 
maximizing the number of different states, subject to the 
three constraints. These constraints are handled by three 
Lagrange multipliers and the solution is the one which 
maximizes 

- ^2 Pb (k) [In PB(k) + a + bk + c In k] (10) 

k 

with respect to the distribution Pg(fc). This leads to 

p B ( k) = (id 




c (N,3) c (N,2) 1 c (N,3) c (N,2) 



c c 

FIG. 2: Figure showing how the Lagrange multiplier b (Fig. a) 
and the box information difference B (Fig. b) depend on c for 
two different average number of balls (2 and 3) and N = 10 6 . 
Here, c (iV,2) = 2.41 and c (N,3) = 2.15. 



^2 k= i PB{k) = 1 we directly get A(b,c) and the re- 
maining constants b and c are obtained from the re- 
maining two constraints. Figure [2](a) shows how b and 
c depend on the ratio M/N. The possible solutions 
ranges from (1 < c < Cq, b > 0) to (c = cq, b = 0) 
i.e. from P(k) ~ exp(— bk)/k c to P(k) ~ /c _c °, where 
c = c (AT, M/N). [Note that c > 2 for large N because 
of the normalization condition]. Using the explicit form 
of the solutions it is easy to show that for M balls and 
TV boxes the scale-free distribution corresponds to the 
smallest B and hence to the largest box information for 
the "least bias" solutions (see Fig. 12(b)) [11]. 

The relation between the global maximum solutions 
mentioned above (which are not obtained within varia- 
tional calculus) and networks will be discussed in the fol- 
lowing. Our conclusion will be that the scale-free network 
corresponds to the global maximum of node information. 



IV. SCALE-FREENESS AND NOISE 

The degree distributions P(k) are average distribu- 
tions. This means that, if pi is the probability for a possi- 
ble distinct system state z, then the average distribution 
P(k) is given by P(k) = *}2iPiNi(k)/N = (n(k)), where 
the sum is over all box distributions corresponding to 
the system states i (denoted by ()). We have so far only 
characterized the system by the average box distribution 
(n(k)}. Our goal is to find as detailed characteristics 
as possible for the maximum node-information random 
scale-free network, in order to be able to decide whether 
or not a particular network obeys the statistical proper- 
ties implied by this particular random scale- free network. 
To this end we also derive the statistical deviations from 
the average distribution (n(k)). This statistical deviation 
is measured by the noise 

£<(n(fc) - (n(fc)) 2 ). (12) 

k 



where the constant A, 6, and c are given by the 
three constraints. From the normalization condition 



The actual calculation follows the same steps as the 
variational calculation of the maximum box information 
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scale- free network in the previous section. Again we ad- 
dress the problem of imposing the fixed information con- 
straint A Si n = B into the DBB-model 

B = ^P(fc) lnfc = (%2n(k) Ink) = ^Pi^n^k) Ink 

k i k 

(13) 

Thus our constraints are now a fixed value B, YliPi = 1 
and ^2 i Pi rii(k)k = M/N. We want to maximize the 
average number of distinct states ft or equivalently 



In £7 = const — ^^Pi ^]nj(k) \n[rii(k)k] 



(14) 



(see Eq. ©). With no bias the problem just corresponds 
to maximizing the entropy — X^.P* m P* subject to the 
constraint pi = 1 [l0[ : the unbiased value of the prob- 
ability for a state is the maximum of — pi [In pi+ const] 
with respect to variations of pi. The answer is obviously 
that all pi are equal i.e. all system states are equally 
probable. Note that the variation is now with respect to 
the probabilities pi of the system states z, whereas in the 
previous sections it was with respect to the average dis- 
tributions P(k). For fixed values of B, Q and M/N the 
problem instead corresponds to finding the maximum of 
the expression (where a, /?, 7 and 5 are Lagrange multi- 
pliers). 

- ^2pi[\npi n ^ k ) m k + P ^2 n ^ k ) ln[ni(fe)fe] 

i k k 

+ -y'$2ri i (k)+5 y 52n i (k)k], (15) 

k k 

with respect to variations of pi. We here use different 
symbols for the multipliers than in previous sections in 
order to emphazise that the variations are with respect 
to a different variable. The result is straightforwardly 
obtained and gives the probabilty for a system state i as 
an exponential pi ~ ex.p(—H/T) where 



— = - a n i( k ) H k ) H n i( k ) k ] 

k k 

■<y^2ni(k) + 5^2ni(k)k 

k k 



(16) 



Here we used the notation ^ for the function in the 
exponent in order to display the equivalence with sta- 
tistical physics and the Boltzman factor: in statistical 
physics the probability of a system state i is given by 
Pi = Z~ x exp(— H/T) where H is the hamiltonian, T 
is the temperature and Z~ x is a the normalization con- 
stant determined by the condition ^2 i Pi = 1. In sta- 
tistical physics Z is called the partition function. The 
average value of any quantity 0(n(k),k) is given by 
exp(— H/T)0(n(k), k)) where the brackets means 
the average over all different states. For the DBB-model 
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FIG. 3: Distribution of box-sizes obtained by Monte Carlo 
simulations where the Hamiltonian from Eq. [17] was used. 
N = 1000, 70 = 2.5 and the results were averaged over 2000 
runs. Figure (a) and (b) shows the results for an average 
number of balls equal to 1.56 and T reaching infinity and zero 
respectively. In (c), T is also close to zero but M/N = 2 and 
boxes (nodes) of order M is forming. Note that the Scale-free 
distribution (b) has the smallest noise. 



the two last constraints in Eq. ([T6|) (i.e. constant number 
of boxes and balls) are included already in the definition 
of the model, so the Hamiltonian H reduces to 



H 
f 



— n(k) In n(k) + 70 n(k) In k] 



(17) 



with T = 1//3 and 70 = (/3 — a)//3. This system state 
probability and Hamiltonian in the network context was 
first derived and investigated in Ref. [l3[. In particular 
Monte- Carlo simulations are easily implemented follow- 
ing the standard proceedure in statistical physics: choose 
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two random balls and then move one to the same box 
as the other with a probability given by the Boltzmann 
factor exp(— H/T). Figure Eta) shows the unbiased es- 
timate obtained from MC simulations which obviously 
corresponds to T = oo and also a = (= no bias). 
This is just the unconstrained DBB-model and the av- 
erage distribution is of the form P(k) ~ exp(— £fc)/fc as 
previously shown. The new characteristics is the statis- 
tical deviations from the average distribution, the noise, 
associated with the least bias (which in this case is no 
bias at all). The least bias always corresponds to the 
smallest ratio a//3 in Eq. ([T6]) and hence to the largest 
70 in Eq. (fT7|) for which the average solution of the form 
P(k) ~ exp(— 5k) /k 1 with 1 < 7 < 70 is obtained. It 
turns out that for each given ratio M/N there is pre- 
cisely one such largest value 70 which is given by the 
condition 

C = ^=1 ™ = M/N (18) 

(note that 70 = cq(N,M) where cq(N,M) is defined in 
the preceding section). This also means that the statis- 
tical noise for the whole sequence of least biased solu- 
tions is uniquely given. Since 70 is unique the noise is 
just given by the "temperature" T : large noise corre- 
sponds to large T and small noise to small, which is, of 
course, precisely what you would expect from statistical 
physics. The scale-free distribution is obtained in the 
limit of small T and consequently has the smallest noise. 
Figure [3jb) shows the average distribution and the noise 
obtained for a low T. The important point is the connec- 
tion between the scale-free distribution and the smallest 
noise. 

What do the two cases C > M/N and C < M/N 
correspond to? In the first case the average distribution 
is of the form P(k) ~ exp(— 5k) /k 1 with 5 > 0. So 
the solutions overlap with the ones obtained for the least 
bias. The point is that these solutions are obtained for 
a smaller 70 and hence for a larger bias. This means 
the same P(k) but a smaller noise. The case C < M/N 
means that 70 is larger than for the least bias solutions. 
In this case boxes with of order M balls are created. An 
example is given in Fig. El^c). Note that these solutions 
with order M nodes are not picked up by variational 
calculus. 

For the network, the presence of nodes of order M 
greatly constrains the number of possible different states 
relative to the DBB-model, because of the no-loop con- 
straint. For example a perfect star with TV nodes and 
M = 2(N — 1) link-ends [ N — 1 single nodes attached to 
a central node of degree N — 1)] for the constrained DBB 
model corresponds to the box information 

AS in = p (k) m fc ^ \n(N - 1) - (19) 

k=l 

for large TV. However, only one of these states is consis- 



tent with the network requirement so that 

AS in - [M In M - N In N] « ln( TV) . (20) 

For a network this implies the solutions containing or- 
der M nodes will always have much smaller node infor- 
mation than the corresponding scale-free network. 

Based on these observations, we conjecture that the 
scale- free network maximizes the node information. 



V. CONSEQUENCES FOR DIRECTED 
NETWORKS. 

So far we discussed undirected networks. However, also 
directed networks has a one-to-one mapping to the con- 
strained DBB-model. In this case a link is again de- 
fined by two balls: if the balls are enumerated 1, 2, , , M 
then the links are enumerated by two consecutive num- 
bers 12,34,,, M — 1M where the first number (which 
is always odd) denotes the start of the link and the last 
(even numbers) denote the end of the link. This does 
not change anything as to the number of different ways 
you can distribute the balls (or links) among the boxes 
(nodes). Thus the scale- free network should again be the 
one which maximizes the node information. Note that 
the direction of the links does not affect the amount of 
node information which a node can carry. Consequently, 
from the point of view of node information the directions 
of the links attached to a node are completely random. 
Thus a network, which is only optimized with respect 
to the node information, acquires characteristic random 
features with respect to the distribution of in- going and 
out- going links attached to a node. 

Some of these characteristic random features for in- 
and out-\mks attached to a node are as follows: Suppose 
that the numbers of the in-, out-, and total numbers of 
links connected to a node are fc^ n , k out , and k , respec- 
tively, so that ki n + k out = k. The average number of 
m-links (ki n ) out for for nodes with precisely the number 
of k out out-\inks are then given by the binomial coeffi- 
cient B(ki n ,k) i.e. the probability to get ki n tails when 
tossing a coin k = k- in + fc out times: 

. Eti::^ B(k - Kut, k)p(k)(k - kout) 
{ )kout ~ Etz: t k -B(k-k ouu k)p(k) ■ 

(21) 

For a the case of P(k) ocl/fc 7 one then finds (k[ n } out « 
fc out [14]. The result (ki n ) out « fc ou t may look innocent, 
but it is completely non-trivial, as realized when com- 
paring to the ER-network: in this latter case there is no 
correlation and (fcm)out = const no matter what size k ou t 
one chooses. Another characteristic feature is the spread 
of the distribution of fci n -links for the nodes with a given 
number of k out . For this spread we use the measure 

S in (k out ) = E(fc - |fc - ) J fc ;"" (fcm)fc °" tl (22) 
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which in terms of the binomial coefficient becomes 

Sin{kout) — 

Etz: t kout B ( k - ^ fc ) p ( fc ) ggff^; 
ztz~ kout B(k-k out ,k)p(k) 

]/2 

This gives 5i n (femt) oc fc out / ,whereas for the ER- 
network the spread is independent of fc out i.e. Si n (k out ) = 
const [3]. From symmetry one has Pi n (k) = P out (k). 
Using the relation (kin) out ~ femt one can motivate the 
(at least) approximate relation 

Ptot{k)~P in {\) = Pout{\) (24) 

for even fe where P £o t, Pin and P on £, are the size dis- 
tributions of the tot-, in-, and o^-links, respectively. 
This means that all three distributions are described by 
the same functional form P(k) = Ptot(k) ~ -Pm(f) = 
^out(f)- The motivation steps are as follows: 

Pin(kin) — Ptot(kout + fen) ~ Ptot((k ut}\k in + fen) 

~P, o£ (2fe n ) (25) 

Taking into account that P to t{2k) only has half as 
many points as ^2 k Pin(k) fixes the normalization con- 
stant to 2 and thus leads to the relation 

2P tot (2k) = P in (k) (26) 

The general connection between the total- and in-, o^£- 
link distributions, 

Ptot(feot) = E Pin(kin)P(k ou t): (27) 

is then of the form 

k 

2 P(2fe) = Y, p (Q) p ( k -Q) for k^2. (28) 

Metabolic networks: These three characteristics 
can now be used to compare with real networks. As an 
illustration we choose metabolic networks: [14[ The data 
shown in Fig. [H are average properties of 107 metabolic 
networks with the average size (N) w 940 (data taken 
from Ref. [IHOil). A metabolic network is constructed 
as follows: Substrates and products in the metabolism 
are nodes. Two nodes are connected if the substance of 
one is a substrate in a metabolic reaction which produces 
the substance represented by the other node. The links 
points from the substrate to the product. The data are 
obtained as the average over 107 such networks and con- 
sequently reflect an ensemble average network structure 
associated with metabolic networks. Figure [11(a) shows 
that metabolic networks have P ou t(k ut) — ^m(fen) and 




^out 



FIG. 4: Metabolic Networks: Average over 107 metabolic net- 
works with data obtained from Ref. ([H]). The same char- 
acteristics as for Model A and Friendly Merging: a) Cumu- 
lative degree-distribution P(> k). The dashed straight line 
line has the slope 7 = 2.3 and (kin) = (k ou t) ~ 4.3. b) Plot 
of (kin) out versus k ou t showing that the data is consistent 
with (kin) out — kout- c) Demonstration that the spread goes 
as Sin(kout) oc k~ut t° reasonable approximation, full line 
from Eq. (|23[) and data points from simulations. The cut- 
off for large k ou t is a finite size effect, d) demonstrates that 
the relation 2P to t(2k) = P% n (k) = P ut(k) is fulfilled as ex- 
pected for a maximum-node-information network (note that 

^(>^) = E^ max ^))- 



also have a broad scale-free degree-distribution as was 
first demonstrated in Ref. [17[. In addition from Fig. 
IH(b) and (c) we conclude for the ensemble average of 
metabolic networks the relation (k in ) w k out holds to 
good approximation in accordance with the maximum 
node-information requirement. Likewise the spread has 
a similar decrease as required by maximum node infor- 
mation. Furthermore, Fig. ffl(d) shows that the relation 
2P to t(2k) = Pin(k) = Pout(k) holds to excellent approxi- 
mation [in accordance with standard practice in network 
analysis contexts, it is in Fig. 21(d) expressed in terms of 
the cumulant distribution P(> k) = E/T" p ( k )}- Tnus 
ensemble averages over metabolic networks show every 
sign of belonging to a maximum node-information net- 
work. 



VI. CONCLUDING REMARKS 

In this paper we introduced and studied the con- 
strained DDB-model. This model has a one-to-one map- 
ping to a network provided the network is allowed to 
have loop-links and to be disconnected. We showed that 
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the maximum box information state for the constrained 
DBB-model is a scale-free size-distribution of the boxes 
described by a power-law with an exponent —7 with 
7 > 2, provided the possibility of having boxes with a fi- 
nite fraction of the balls are excluded (exclusion of order 
M boxes). Next we observed that the presence of order 
M nodes is effectively suppressed for connected networks 
with no loop-links. This led to the conjecture that the 
scale-free network gives the maximum node information. 
This conjecture in turn leads to explicit characteristic fea- 
tures for the scale-free maximum node-information net- 
work. We showed that an ensemble of metabolic networks 
has these features and concluded that metabolic network 
appears to have evolved in a way which maximizes the 
node information. The obvious question is then "Why?". 



We have at present no answer to this apart from the gen- 
eral observation that keeping as many options as possible 
increases the chances to adapt to new conditions. Neither 
has it escaped our notice that in the case of metabolic 
networks the ordering of the link ends on a node corre- 
sponds to a time-ordering and that further a particular 
time-ordering is required for creating a specific functional 
metabolic pathway. If this is the case one might specu- 
late that maximum node information corresponds to the 
maximum number of potential metabolic pathways. 
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